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^ c| Abstract 

The classical capacity of a quantum channel with arbitrary Marko- 
vian correlated noise is evaluated. For the general case of a channel 
with long-term memory, which corresponds to a Markov chain which 
Q" 1 does not converge to equilibrium, the capacity is expressed in terms 

of the communicating classes of the Markov chain. For an irreducible 
and aperiodic Markov chain, the channel is forgetful, and one retrieves 
| the known expression [TS] for the capacity. 
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1 Introduction 



Shannon, in his celebrated Noisy Channel Coding Theorem [22] , obtained an 
explicit expression for the channel capacity of discrete, memorylesa3, classical 
channels. The first rigorous proof of this fundamental theorem was provided 
by Feinstein |8j. He used a packing argument (see e.g. [HI] ) to find a lower 
bound to the maximal number of codewords that can be sent through the 
channel reliably, i.e., with an arbitrarily low probability of error. More pre- 
cisely, he proved that for any given 5 > 0, and sufficiently large number, n, of 
uses of a memoryless classical channel, the lower bound to the maximal num- 
ber, N n , of codewords that can be transmitted through the channel reliably, 
is given by 

Here H(X : Y) is the mutual information of the random variables X and 
Y, corresponding to the input and the output of the channel, respectively. 
This lower bound implies that for n large enough, any real number R < 
C = max H(X : Y), (the maximum being taken over all possible input 
distributions), at least N n = [2 nR ] classical messages can be transmitted 
through the channel reliably. In other words, any rate R < C is achievable. 

The assumption that noise is uncorrelated between successive uses of a 
channel is not realistic. Hence memory effects need to be taken into account. 
In this paper we consider the transmission of classical information through a 
class of quantum channels with memory. The first model of such a channel 
was studied by Macchiavello and Palma [17]. They showed that the trans- 
mission of classical information through two successive uses of a quantum 
depolarising channel, with Markovian correlated noise, is enhanced by using 
inputs entangled over the two uses. A more general model of a quantum chan- 
nel with memory was introduced by Bowen and Mancini [3] and also studied 
by Kretschmann and Werner |15j . In particular, in [T5] , the capacities of 
a class of quantum channels with memory, the so-called forgetful channels 
were evaluated. Similar results were obtained by Bjelakovic and Boche [2]. 
Further, in [7], the classical capacity of a class of quantum channels with 
long-term memory was obtained. The memory of the channel considered in 
[7] can be viewed as a special case of a general Markovian memory, where 
the Markov chain is aperiodic but not irreducible, and hence does not con- 
verge to equilibrium. Recently, there was a generalization of the result of 
[7] by Bjelakovic and Boche, who in [3] obtained the classical capacities of 
compound and averaged quantum channels. 

1 For such a channel, the noise affecting successive input states, is assumed to be 
perfectly uncorrelated. 
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Another interesting special case of a channel with long-term memory is 
that in which the memory is described by a periodic Markov chain. A simple 
example of this is a channel given by alternating applications of two com- 
pletely positive trace preserving (CPT) maps $1 and $2, with the first map 
being $1 or $2 with probability 1/2. 

In this paper we study channels with arbitrary Markovian correlated noise. 
This includes, in particular, the above special cases. We show that the ca- 
pacity in the general case can be expressed in terms of the communicating 
classes of the underlying Markov chain. 

We start the main body of our paper with some preliminaries in Section 
El In Section [31 the quantum channel is defined and its capacity is stated in 
the main theorem, Theorem [TJ of this paper. In Section HI we prove a special 
case of the direct part of this theorem, corresponding to a Markov chain 
which converges to equilibrium and is hence forgetful. This section therefore 
provides an alternative proof of the result of Kretschmann and Werner [15] 
for the classical capacity of such a channel. This proof is extended to the 
case of an arbitrary Markov chain in Section [5j In the latter, we employ the 
idea of adding a preamble to the codewords (as was done in [TJ) in order 
to distinguish between the different communicating classes of the Markov 
chain. The proof of the (weak) converse part of our main result (Theorem 
CD) is given in Section El 

2 Mathematical Preliminaries 

Let TC and K be given finite-dimensional Hilbert spaces and denote by B(TC) 
the algebra of linear operators on TC. We also consider the tensor product 
algebras A n = B{TL® n ) and the infinite tensor product C*-algebra obtained 
as the strong closure 



where we embed A n into An+i in the obvious way. Similarly, we define 
B n = B{JC® n ) and B^. A state on an algebra A is a positive linear functional 
on A with 0(1) = 1, where 1 denotes identity operator. If A is finite- 
dimensional then there exists a density matrix (i.e., a positive operator 
with Trp^ = 1) such that 4>(A) = Tr(p^A), for any A G A. We denote the 
states on Aoo by iS(^4oo), those on A n by <S(*4 n ),etc. 
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3 A quantum channel with classical memory 



Let there be given a Markov chain on a finite state space / with transition 
probabilities {qa>}i,i'ei an d let be an invariant distribution for this 

chain, i.e. 

%' =5^7i?«'- ( 2 ) 

iei 

Moreover, let $j : B(H) — > £>(/C) be given completely positive trace-preserving 
(CPT) maps for each i £ I. Then we define a quantum channel with Marko- 
vian correlated noise, by the CPT map $oo : S(Aoo) — > S(Boo) on the states 
of Aoo by 

($ OO )(0)(A)= 7nfei 2 ---^-xi„Tr [($ il ®---®$ in )(p 4>n )A\ (3) 

il,...,i„El 

for A G £>„. Here, n is the restriction of (f> to and p0 n its density matrix. 
It is easily seen, using the property (EJ), that this definition is consistent and 
defines a CPT map on the states of Aoo, and moreover, that it is translation- 
invariant (stationary). 

We denote the transpose action of the restriction of $oo to S(A n ) by 
$(n) . js^ n ) -> B{K,® n ), i.e., 

Ti(* w (^M) = (^))W. 

for a density matrix G B{Ti® n ), <p G 5(A)- 
Note that 

$(»)(,(»)) = £ 7n^ 2 ---^ l ( < fn®---®^J(P (n) )- (4) 

ii,...,i„el 

Let us consider the transmission of classical information through $("■). 
Suppose Alice has a set of messages, labelled by the elements of the set 
Jv[ n = {1,2,..., M n }, which she would like to communicate to Bob, using 
the quantum channel $. To do this, she encodes each message into a quantum 
state of a physical system with Hilbert space H,® n , which she then sends to 
Bob through n uses of the quantum channel. In order to infer the message 
that Alice communicated to him, Bob makes a measurement (described by 
POVM elements) on the state that he receives. The encoding and decoding 
operations, employed to achieve reliable transmission of information through 
the channel, together define a quantum error correcting code (QECC). More 
precisely, a code of size N n is given by a sequence {p\ n \ E^}^ where 
each is a state in B(H® n ) and each E\ n) is a positive operator acting in 
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/C 0r \ such that Et"^ < lW - Here ' l(n) denotes the identity operator 
in £(/C® n ). Defining = 1&* - E£\ ^f , Y iel ds a Positive Operator- 
Valued Measure (POVM) {Ei n) }^ in /C® n . An output i > 1 would lead to 
the inference that the state (or codeword) pf 1 ^ was transmitted through the 
channel <&( n ) , whereas the output is interpreted as a failure of any inference. 
The average probability of error for the code is given by 

n i=l 

If there exists an n G N such that for all n > n , there exists a sequence of 
codes {C (n) }£° =1 , of sizes N n > 2 nR , for which P e (C {n) ) -> as n -> oo, then 
is said to be an achievable rate. 

The classical capacity of $ is defined as 

C($):=sup J R, (6) 

where R is an achievable rate. 

Let C be the set of communicating classes, C, of the Markov chain [19] 
for which 

Any other classes can be disregarded. For C G C we define 

«&£>(,<»>) := J_ £ 7n^ 2 ---fc-^(*n®"--®^J(P (n) ), (8) 

il,...,i n &C 

which represents the restriction of the classical memory of the channel to 
the class C. Notice that the Markov chain restricted to C G C is necessarily 
irreducible, and is either aperiodic or periodic with a single period. In fact, 

C = C liC 

where C aper denotes the set of communicating classes in C which are aperiodic, 
while C per denotes the set of communicating classes in C which are aperiodic. 

If C G C aper , we define, for any ensemble {pi-, Pj } of states on TL® n , the 
mean Holevo quantity for the class C as 



X ( c\{pf\pf}): ' 



n 



(9) 
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If C G C per is periodic, with period L, then C = {io,ii, ■ ■ ■ , il-i) f° r certain 
i , . . . , i L -\ e I, and q ikik+1 = 1 for k = 0, . . . , L - 2 and q iL _ lio = 1. In this 
case, 

n = \ic (ieC) (10) 



and we set 



^({p^}) = ^E^Scfrf.^}), 



i6C 



(12) 



where for 6 {0, 1, . . . , L — 1}, 

xg fe ({p?U}) = ^ K ® • • • ® V.Ip'"')) - 4 n) > 

(the indices in the subscripts being taken modulo L), with 
pin) = J^pWpW, and ^ = Epf 5 (^®*W ■ • -®*Wx(^ B) ))- (13) 



Our main result is the following theorem. We use the standard notation 
A for minimum and V for maximum. 

Theorem 1 The classical capacity of a quantum channel with arbitrary Marko- 
vian correlated noise, defined by is given by 



C(Q) = lim sup 



( in] m -, 



.cec 



(14) 



The existence of the limit in (|14|) is proved in Lemma [221 of Appendix A. 

Before proving Theorem H] we consider the special case in which the 
Markov chain has a single communicating class, and the latter is aperiodic 
and irreducible. 



4 Ergodic memory case 

In this section we assume that the underlying Markov chain is aperiodic and 
irreducible (see e.g. [19]) so that in particular, the invariant distribution, 
is unique. It is well-known that the corresponding Markov chain is 
ergodic and consequently the output states of the channel are also ergodic. 
In this case, the Markov chain satisfies the property of convergence to equi- 
librium, i.e., 

in) 

Pii as n ^ oo, 
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where p\, denotes the n-step transition probability from the state % to the 
state j, G I). This implies that the correlation in the noise, acting on 
successive inputs to the channel, dies out after a sufficiently large number 
of uses of the channel. Hence, in this case the channel belongs to the class 
of channels introduced and studied by Kretschmann and Werner [IS], and 
referred to as forgetful channels. 

Suppose that {pf\ Pj }/=i ^ s a sec L uence of states given by density ma- 
trices pj on H® n with probabilities , Yl"j=i Pj = 1- 

The Holevo quantity for the channel restricted to A n is given by 

x({pf\^ n) (p ( ; i) )}) = s ( j^pf^Kpf) ) - Epfs^(pf)) (i) 

\J'=1 / 3=1 

The classical capacity of a quantum channel with classical ergodic memory 
is stated in the following theorem, which is a special case of Theorem [U 

Theorem 2 The classical capacity of a quantum channel with memory de- 
fined by (dp, where the underlying Markov chain is aperiodic and irreducible, 
is given by 

**(<&)= lim- sup x({pJ B \* (B) (pf)}) (2) 

The existence of the limit in (j2J) is proved in Lemma [221 of Appendix A. 

This expression for the capacity was in fact stated and proved in [IS] . 
We present an alternative proof which can then be extended to the case of a 
general Markov chain. The latter is done in Section [SJ 

The direct part of Theorem El i.e., the achievability of any rate R < 
follows from Lemma [1] given below, which is itself a generalization of 
the Quantum Feinstein Lemma for a memoryless channel [7] . The weak 
converse part of Theorem El is proved in the general case in Section [HI 



4.1 Quantum version of Feinstein's Lemma 



Lemma 1 Let denote a quantum memory channel with Markovian cor- 
related noise, defined by (T3|). Suppose that the Markov chain is aperiodic 
and irreducible. Let x* = X*{&) be given by (TJ|). Given e > 0, there 
exists uq E N such that for all n > uq there exist at least N > 2 n ^ x *~ e ^ 



states with density matrices p^' , . . . , p$ G B{7i® n ), and positive operators 
E[ n \ . . . , G B(1C® n ) such that £f =1 < 1^ and 



Tr 



~(n) 
Pk 



E 



in) 



> 1 



e. 



(3) 
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Proof. Choose l so large that 



- sup 



x({p!°\® {1o) (p! o) )})-x* 



1 



(4) 



Then assume that the supremum is attained for an ensemble {Pj , pf }j = u 
for a finite J. 

Denote for m G N 

where 



$Mo) (j)f m ^ 



ft _v> ] >) 

i=i 

These states form a compatible system of states on {£> m ; }^ =1 and hence a 
state 0oo on by 

^(A) = Tr {a ml °A) (7) 

if A G i3 m ; . This state is clearly Z -periodic, i.e. invariant under translations 
over multiples of Iq . Therefore, the mean entropy 

S M {M ■= hm -S (a ml °) = inf -S (a ml °) (8) 

exists. 

For / sufficiently large, the mean entropy Sm(0oo) is close to S (o7 ), the 
von Neumann entropy of the average output of Iq uses of the channel. This 
is stated in the following lemma. 

Lemma 2 Given e > there exists L > such that for l > L, 



(5) 
(6) 



e 

< - 



(9) 



Here 0^ is given by (J7J) . The proof is similar to that of Lemma [21 

Henceforth Iq is fixed to a value such that Lemma [5] and (j3J) hold. For 

notational simplicity, explicit dependence on l is often suppressed. 
The proof of Lemma Q] requires the sequence of lemmas given below. 

Lemma 3 The state 0oo is strongly clustering and hence completely ergodic 
for l -shifts, i.e., for any A,B G B m i , 

lim ^{Ar kl °{B)) = Tr (a mlo A) Tr {a mlo B). (10) 

k— >oo 
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Proof The proof is standard and relies on the fact that the expectations 
of A and B in the state 0^ decouple as their supports are separated by a 
sufficiently large distance. This is because 

Yl QiM---Qik-iik9{ik) ^^2li9(i), (11) 

as k — > oo, for any function g(i), since the Markov chain is irreducible and 
aperiodic. □ 
In the following we denote /C®'° by /Q . We also use the following lemma, 
which is proved in Appendix B. 

Lemma 4 For any 5 > 0, there exists mi G N such that for all m > mi 
there exists a subspace Te C JCf Q m with projection P m i such that 

Pmlo <Tmh Pml < 2- m ^^-W^ (12) 

and 

Tr (a mlo P mlo ) >l-5 2 . (13) 
Here l( m '°) denotes the identity operator in i3(/C® m '°). 

In order to obtain the first term in the expression ([2]) for the capacity, we 
need to be able to replace Sm(0oo) in the above lemma by S(&i ). This is 
possible due to Lemma EJ 

We need an analogous result to Lemma H] for the second term in the 
expression (j2J) of This is stated in Lemma [6] (which is proved in 

Appendix C). It uses Lemma El given below. To formulate these lemmas, we 
define density matrices £ m / in algebras 

j 

M ml0 = B(K%») 

jl,-jm=l 

by 

z ml0 = pf*w ® • • • ® p m) , (14) 

jit— ,3m 

where p^ = Yl™ =1 pf° and p^, j G {1, 2, . . . , J}, belongs to the maximising 
ensemble (c.f. (BJ). In the following we denote p^ ml °^ = <g> • • ■ <g> p^, with 
3 = U1J2, ■ ■ ■ , jm), for any m G N. 
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Lemma 5 There exists a unique translation-invariant state ip^ on Ai^ = 
Um=i M m i such that 

^(A) = Tr (Z ml0 A) (15) 

for A G A4 m i . Moreover, this state is strongly clustering and therefore 
completely ergodic. 

Proof. The proof of this lemma is similar to that of Lemma El □ 
Note that the mean entropy of ^oo is given by 

SWOAoc) = lim jS(Y,ki ), (16) 

k— >oo K 

where 

spuo) = E -s , (pf* (Wo) (pS ,> ®---®^ ) )) 

3lt—Jk 

= E^f^( $(Wo) ^f o) ))+^(^ 0) }/=i)- ( 17 ) 

i 

We define 

5m - 5 M ({p?\ pf >}) := fim I E>f S (* (Wo) (pf 0) )) • (18) 

j 

Lemma 6 Given 5 > 0, there exists rri2 G N such that for all m > ni2 there 
exist, for all j = (ji, . . . ,j m ) G {1, . . . , J} m , one- dimensional subspaces 

o//C® m (indexed by k in some set ) with projections tt^^ in the j-th 

component of A4 m i , such that for all k G Tj™' , 



1 

m 



logu,^ +S M {{pf\pf ) }) 



< 



(19) 



where u\ J o) = Tr ($( mi o)( p H)),H)j ? flnd 



/ 



\ 



> i - s 2 . 



(20) 



Proof See Appendix C. 
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We now continue the proof of the theorem. In the following we denote 

P Mo) = ^Mo) (21) 



The remainder of the proof is in fact analogous to that for the case of a 
memoryless channel (see [6], [7j), so we only sketch the main steps. 



For arbitrary n, let m = [n/l ] and denote, IT n = P m i ® l n - m i , ni- 



p{mi ) ^ i n _ mlo an j p(n) _ Tr (?n+1)/o _ n a-( m+ i)/ . Now let N = N(n) be the 
maximal number of states p\ , . . . , pff on 7-t® n for which there exist positive 
operators E[ n) , . . . , on /C® n , of the form E^ n) = Sf° ® l n -mi , such that 

« Ef=i4 n) <nnand 

(ii) Tr[^ } 4 n) ]>l- e and 

(hi) Tr [a^E^] < 

Here = ^(#). 

For any given j G {1, . . . , J}" 1 define 

(AT \ 1/2 / AT \ X /2 

n «-I>n nnnfnJn n -g4"M . (22) 

Clearly, < f[ n — Ylk=i ^k^-i anc ^ we a ^ so nave: 
Lemma 7 There exists an n\ G N szic/i i/iat ifn>n\ then 

Tr (a (n V/ n) ) < 2- n ^(*)-i e ], (23) 

/or a// j . 

Proof. Let Q n = Ylk=i -^k ■ Note that Q n is of the form Q n = Q m i ® 
ln-mio! since = S™ 10 ® l„_ mio . Note that Q n commutes with tl n by 
condition (i). Now, by Lemma HJ we have 

PmloVmloPmlo < 2- m ^^~^l^ (24) 

and, assuming that lo > L, we have by Lemma El 

< 2~ n[ ^ Si ® {l ° ){p)) ~ lie] l {mlo) (25) 
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provided 



^—^S(^\p)) <-(n-m- ~mZ )e, 

L 4 I 



(26) 



which holds if Iq > 6 and 



12 

m > — log dim /C 



since ±S(& l °\p)) < log dim/C. 
Using this, we get 



Tr (a {n) vj 



Pmlo&mloPmlQ^PmlQ Qmlo) ^ Pj {Pfnlo Qmlo) ^ 



= Tr 
= Tr 
= Tr 

< 2 

< 2 ^ 5( * (i0)( ^ )) "^ l Tr (P 
However, by Lemma El 



n n a^n n (n n - Q„) 1/2 nf } (n n - g n ) 1/2 

(mj ) 

_„[ ] L 5 ($( i o)(p))_Ie] Tr 



{Pmlo Qmlo) ^ Pj {Pmla Qmlo) ^ 



(ml )\ 



(27) 



Tr (p (mio) ) < 2 m[§M( - {p< ] 0) ' p 3 O)})+ ^ ] 



< 2 «[^5„({ P ^ ) ,p< i « ) }) + |e] 

< 2 n[iE,pF 0) ^ 0) )+| e ] 



(28) 



where the last inequality follows from the subadditivity of the von Neumann 



entropy. The lemma now follows from 
Since N(n) is maximal it follows that 



Corollary 1 



Tr IffW 9 ' 



E Tr 



< 1 - e. 



< 1 - e. 



Lemma 8 Assume rj > 35. Then for all n > n 2 = rriil V m 2 lc 



E ( Tr 



> 1 - T]. 



□ 

(29) 
(30) 

(31) 
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Proof. We write 
E(Tr 



u j L1 n L1 j LL n 



W n (n) 
3 3 



E (Tr 

— E [ Tr 



E ( Tr 



oj n) (i-n n )n$. 



( n )fr rrH 



a- U n U)' l '(l-U r 

(ml ) ^> (ml )^ 



(32) 



The first term equals E[Tr (a ™ 'p) m ? which by Lemma[6]is > 1 — 5 2 , 
provided n > n 2 . 
Note that 



E [Tr 
and similarly 
E f Tr 



af(i-n n )n 



(n) 
3 



E Tr 



(ml ) 



(33) 



E Tr 



^ Mo) P mio ^ mtOJ (l-^o) ) (34) 



,(m/ ) 



Using ( |33i) and ( 13^1) , the last two terms on the right hand side of ( l32i) can be 
bounded using Cauchy-Schwarz and Lemma H] as follows : 



E Tr 



of (i -fi^n 



and 



E Tr 



ofn n nf (i-n n ) 



< <5 



< 5 



(35) 



(36) 



provided n > n\. Choosing n.3 = n± V n 2 and <5 2 + 2<5 < 77 the result follows. 
□ 

Lemma 9 Assume r\ < \e and r\ > 35. Then for n > n% = n\ V n 2 , 



Tr 



N 
fc=l 



E Tr 



A? 



fc=i 



> 



(37) 



Proof. Define 



g; = n„-(n n -g n ) 1/2 . 

By the above corollary, 

i-e > e{tt (of (n n -g;)nf (p^-g;))} 



EMi [of fijlf fi, 



-E{Tr (of g;nfn„) +Tr (of n n nf g;)} 
+E{ Tr (ofg;nfg;)}. 



(3? 



(39) 
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Since the last term is positive, we have, by Lemma [8], 

E {Tr (<xf Q'jlf P n ) + Tr (af PJlf Q' n ) }>e- V >2 V . (40) 

On the other hand, using Cauchy-Schwarz for each term, the left-hand side 
is bounded by 

2 { E [Tr (afQ'„ 2 )]} 1/2 . (41) 



Thus, 



E 



Tr (af <%) 



> 



To complete the proof, we now claim that 

Qn > (Q'rf- 



(42) 



(43) 



Indeed, this follows on the domain of P n from the inequality 1 — (1 — x) 2 > x 2 
for < x < 1. □ 
To complete the proof of the theorem, we now have by assumption, 



Tr 



(44) 



for all k — 1, ... , N(n). On the other hand, choosing r\ < |e and 5 < |r/, we 
have by Lemma [HI 



Tr 



k=l 



> V 



provided n> n 3 . It follows that 

N(n) > 7? 2 2 "[x*(*)-!e] > 2™[x*(*)- £ ] 

for n > n 3 and n > — - log?]. 



(45) 

(46) 
□ 



5 The case of a general Markov chain 

In the following we write 



l k + n-l 



(47) 



if % = Zfc is in a periodic class C, and where the labelling is modulo the length 
of the class. 
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5.1 The direct part of Theorem [T] 



In this section we prove the direct part of Theorem [TJ As in the ergodic 
case (considered in Section 0J), we once again employ a quantum Feinstein 
Lemma, which is a generalization of Lemma Q] and is given by the following 
lemma. 



Lemma 10 For all e > 0, there exists n € N such that there exist at least 

, on H® n and positive opera- 
<n) < 1 and 



N = N n > 2 n P(*)- e ] product states p^ ] 
tors E[ n \ . . . , on K® n such that £ 



k=l 



Tr 



> 1 - e 



(48) 



for all k = 1, . . . , N. 



The proof of this lemma is given in Section [531 It uses the idea of adding 
a preamble to the codewords (as was done in [7]) to distinguish between the 
different classes of the Markov chain. The construction of the preamble is 
discussed in detail in the following section. 



5.2 Construction of a preamble 

To distinguish between the different classes, $q , of the quantum channel 
$, we add a preamble to the input state encoding each message in the set 
M. n . This is given by an m-fold tensor product of suitable states (as described 
below). Let us first sketch the idea behind adding such a preamble. Helstr0m 
[TT] showed that two states U\ and cr 2 , occurring with a priori probabilities 
7i and 7 2 respectively, can be distinguished with an asymptotically vanishing 
probability of error, if a suitable collective measurement is performed on the 
m-fold tensor products erf m and erf m , for a large enough m G N. The optimal 
measurement is projection-valued. The relevant projection operators, which 
we denote by Il + and Il~, are the orthogonal projections onto the positive 
and negative eigenspaces of the difference operator A m = 7 1 o r f m — 72crf m . 
Here we generalize this result to distinguish between the different classes . 
If the preamble is given by a state u® m , then, by using Helstr0m's result, 
we can construct a POVM which distinguishes between the output states 
:= (a>® m ) corresponding to the different classes $^' ) . The outcome 
of this POVM measurement would in turn serve to determine which class of 
the channel is being used for transmission. 

We first show that there exists a preamble that can distinguish between 
the different classes, analogous to the branches in [7j. In fact, we want 



15 



to do more. In the case of periodic classes, we also want to distinguish 
between initial states of the class. We therefore subdivide the problem into 
the following four possibilities: 

1. To distinguish between two aperiodic classes; 

2. To distinguish between an aperiodic class C and an initial state i' of a 
periodic class C'\ 

3. To distinguish between two periodic classes C and C'\ and 

4. To distinguish between the states of a single periodic class. 

We refer to the aperiodic classes and the periodic classes with given initial 
state, as branches of the channel. 

Consider the first problem: distinguishing between two aperiodic classes. 
We can obviously assume that the ^ for some n: otherwise the 
classes are identical and we can combine their probabilities. This means that 
for any pair of aperiodic classes C, C there exists n = n{C, C) and a state 
uP c , such that ^c\uP Cf ) + $g ] 

i^cc')- I n f &ct ' m most cases we can take 
n = 1, and we shall assume this for simplicity in the following, even though 
this is not necessary. 

Introducing the fidelity of two states as in [T8] . 

F(a, a') = Tr Va 1 / 2 ^ a 1 / 2 , (49) 

we then have 

F($gVc,c), ^M) < / < 1, (50) 

for all pairs C, C with C < C in some arbitrary ordering of C apeT , the set of 
aperiodic classes. 

The following lemma shows that the classes C and C can be distin- 
guished. 

Lemma 11 For any two aperiodic classes C and C , 

- as m - oo. (51) 



Proof. Choose a > so small that 1 + a < f 1 . First let k be so large that 
(1 - a)jj < <(l + a) 7i (52) 
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for all i,j e I. Now let {£ r } r be a POVM such that 

F(a, a') = J2 VTr^Tr^), 



(53) 



(see e.g. Eq.(9.74) in [IS]) where we denote 

a = $)9(u)c,c') and a' = ^j§(u c ,cj 



Then we have 

/' / ,T> (rnk+rn) / ®(mk+m)\ ^(rnk+rn)/ ®(mfc+m) 

b I <P C (w C)C , J,$ c , (u cc , 

m 

(mk+m) / Cg>(mfc+m) 



< 



ri,...,r„ 



Tr I <I' r UV.r' 



x Tr I 



(mfc+m) / ®(»nA;+m) n, 



C,C" 



) ® lfc) 

i=l 

m 

® ® 1/ 

i=l 



1/2 



< ^ (1 + a) m - 1 J] ^Tr (a^J Tr (<r'£ r 

ri,...,r m i=l 

= (l + a) m - 1 F(a,(T , ) m ^0. 



(54) 



(55) 
□ 

Next consider the second case, i.e., to distinguish an aperiodic class C 
and an initial state i' of a periodic class C . There exists a state u = iOcy 
on 7i such that 

/:=F($g ) (^ cy ),$ J ,(^cy))<l. (56) 

Lemma 12 Lei C &e an aperiodic class and C a periodic class with length 
L = L(C), let i' e C , and choose uj = uJc,i> above. Then 



F ($P(u® m ), $^,(^® m )) -> 



(57) 



as m — > oo. 



Proof. We proceed as in Lemma [TT] and choose a > so small that 1 + a < 
f^ 1 and let k be so large that (1321) holds and in addition such that A; is a 
multiple of L. Again, we let {S r } r be a POVM such that 



F(a, a') = \M (aS r )Tr(a'S r ), 



(5? 
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where now 
Then 



a = $g(u c ,i>) and a' = $i>(u}c,i')- 



(59) 



("OA' K U CA> 



c 

ri,...,r„ 



Tr $ 



(mfc+m) / Cgi(mfc+m)\ 



C 



(g)(^r, ® lfc) 



i=l 



1/2 



< ^ (i+«)^nv Tr K) Tr (^ 



ri,...,r„ 



asm-* oo. 



(60) 
□ 



Distinguishing two periodic classes is straightforward: 



Lemma 13 If C and C are two different periodic classes with periods L{C) 
and L(C) respectively, then there exists a state 0Jq C , on 7i® L , where L = 
L(C) L(C) such that 

F (4 mi) ((^) 0m ),^ L) ((^) 0m )) - (61) 

as m — > oo. 

Proof. Since the two periodic classes are distinct, there exists a state uj = 
ojq C , such that 

a = ^ L \J c %)^^(J c %) (62) 

(In fact we can take L to be the least common multiple of L(C) and L(C').) 
Then writing u = ujc,c ® tp® k , where (p is an arbitrary state on TC and k is 
so large that ( |52l) holds, 



F $ 



C 



< 



< (l + arF^^.^Mj^O. (63) 



□ 
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Finally, to distinguish the initial states of a given periodic class C, notice 
first of all that the corresponding CPT maps $j need not all be distinct! 
However, we may assume that there is no internal periodicity of these maps 
within a periodic class; otherwise the class can be contracted to a single such 
period. This means, that for any two states £ C there exists I < L(C) — 1 
such that <3>j + z 7^ ^V+z. Then choose u = 00^ such that 

/:=F($ i+ ,H,^ +J (a;))<l. (64) 

Lemma 14 If C is a periodic class with period L(C), i,i' £ C and uj is a 
state as above, then 

F ($gj (u® m ), $g (c^ m )) ^ asm ^00 (65) 



Proof. 



/•w$& ) fw® m ),$5$(w e9m ; 



< [F($ w H,$^M)r = r-o. 



Ffe L ](^ L ),$^ ; (^ 



(66) 
□ 

We now introduce, in each of the four cases, difference operators Aqq,, 
Aq 1 ),, A^) with i,i' in a periodic class, and corresponding projections IT 

and Hf it onto their positive and negative eigenspaces , which serve to 
distinguish the different possibilities, as in [7J. The difference operators are 
defined by 



± 

C*,C"> 



= lc(^o ] ^c,c')T m - 7c(*&W)) 

41 = 7c(4 m W)f m - 7<<(*&W)) 



0rn 



and 



The following lemma was proved in [7j: 
Lemma 15 Suppose that for a given S > 0, 



(67) 
(68) 

(69) 



iTrP^I 



(7c + 7c) I < S. 



(70) 
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Then 

|Tr[nJ c ,($( m )(^™))]-l|<^- (71) 



and 



|IY[n- c ,(ct>^ «?))] -l|<2^- (72) 



Here c, c' denote either two different classes C, C" or one aperiodic class 
C and an initial state i' in a periodic class, or two different initial states in 
the same periodic class. 

To compare the outputs of all the different branches of the channel, we 
define projections flj on the tensor product space JC® mM where 

M = M 1 + M 2 + M 3 + M 4 , (73) 

with 

1. Mi is the total number of pairs of aperiodic classes; 

2. M 2 is the total number of pairs of periodic classes; 

3. M 3 is the total number of pairs of aperiodic classes and intial states of 
periodic classes and 

4. M 4 is the total number of pairs of states in the same periodic class. 

We introduce an arbitrary order on the classes C G C assuming C < C if 
C G C apcr and C G C pcr . Then we put 

{I m if d 7^ c and c" ^ c 
n- c if c" = c ^ (74) 

IT+ C „ if d = c. 

It follows from the fact that IT J C „U.~, c „ — 0, that the projections n c are 
also disjoint: 

n ci fl C2 = for ci ^ c 2 . (75) 
We use the following lemma. 
Lemma 16 For all aperiodic classes C , 

~Ilc® { c M) > (mM) )] =1 > ( 76 ) 
and /or all periodic classes C and all i G C , 



lim Tr 

m— >oc 



lim Tr 

m^oo 



n c ,,$g M) (a;^ M ))l =1. (77) 
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Proof. Notice that for all (c, c'), 



F( 7c $( mM )(^ iC O 0m ,7 c '<f? M Vc, c ') 0m ) 

= v^F($( mM )K c o,$? A/) K c O) - o 

as m — >• oo. Using the inequalities 



(78) 



Tt(A 1 ) + Ti(A 2 )-2F(A 1 ,A 2 ) < WA.-A^ 

< Tr (Ax) + Tr (A 2 ) 

for any two positive operators A% and A 2 , we find that 

|^(l45i)-(Ti + 7i)|<^, 
where <5 m — > as m — > oo, since 

Tr = ll7 c ^ M) KcO^-7 c '^ M) (^, c O 0m ||i. 

We now replace m by ml = m + k, where k £ N is large enough so that 
holds, and define 



(79) 



10) 



(81) 



(m'M) 



®(m+fc) 
ci,c 2 > 



(82) 



(ci,c 2 ) 



Using ( 1521) to separate the different classes, we then have for any C E C, 



aper ; 



to. 



®(m+fe) 



ci,c 2 



\ei<c 2 



1 > Tr 

> n Tr n c',c(^ n) (^ m c)) 



C £ Caper fC r< C.C 

>< n Tr [ n ^»(^ m) «c-)) 

C" £ Caper jC" ^"C 



n ri Tr k,'K m) «?)) 



since <5 m — > as m — > oo . The last inequality follows from Lemma H51 



The analogous result, (1771) . for periodic classes, is proved in a similar 
manner. □ 
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5.3 Proof of Lemma [in 



Given 5 > 0, we now fix m so large that 

Tr 



for all C G C aper and 



Tr 



> i 



> 1 -5 



(84) 



55) 



for all C G C per and z' G C". Here M is given by (!73|) . The product state 
ct/ m ° M ), defined through (1H21) . is used as a preamble to the input state encod- 
ing each message, and serves to distinguish between the different branches 
of the channel, i.e., between $c, C G C aper and $c',i, C G C per and i G C". 
If pjj. G B(H® n ) is a state encoding the /c t/l classical message in the set Ai n , 
then the k th codeword is given by the product state 

JmoM) ^ p {n) _ 

We follow the same steps as in the proof of Theorem 5.1 in [7j. First we 
fix l large enough, and an ensemble {pf , pf ^} such that 



cm- /\x { i o) ({pf o) , P f o) }) 



< 



(86) 



As in the ergodic case (Section H]), let N = N(n) be the maximal number 



of product states p^ , . . . , p y ^ on 7i® n for which there exist positive operators 
E[ n \ on }C® moM <g> /C® n such that 



» 



i) 4 n) = E 



ceCa 



n 



c 



+ Ecetw n c,' ® and 



(n) 



EiLi < P ^ Eti K}< < Pi',n for i' £ C £ C pcr , and 
£ 7c Tr [(n c ®^)4 m ° M+n) (^ (moM) ®4 n) 

C €:Caper 

+ E E ^ ,Tr [ ® 4 n c) $ cv M+n) (^ (moM) ® # 

> 1 — e, and 

7 cTr [ (He ® £$)4 w ° M+n) (u/™° M ) ® ^) 

Caper 

+ E E ^' Tr [ o^.? ® 4?)* c moM+n) (^ (moM) ® p (n) ) 

C'^Cpcr i £C 

< 2 - n [C($)-Ie]_ 



n(n) 



111 
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Note that, as in the ergodic case, we can append l>- m '°) to all POVM 
elements, to reduce the proof to the case n = m/ - In the following we 
therefore assume n = ml Q for simplicity. 

The typical projection Pc, n for an aperiodic class is defined as before by 
Lemma HI For a periodic class C we define the typical spaces by interlacing 
those for the product channels $f n (i G C), as follows: 

Lemma 17 Let C be a periodic class with period L. Given e,5 > 0, there 
exists m[ G N such that for m > vn! x there are subspaces ~M™ e C /C z ® m 
(i G C ), (n = mlo), with projections P^ n such that 

Pi^C'APkDhn < 2- m ^-f], ( 8 7) 

where 



L-l 

L 

i=0 

and 



Tr (^clM^n) >l~S 2 . (81 
Proof. We simply let A/*^" be the subspace spanned by the vectors \ipi,ki) 



■ ■ •<g#i+z (n-i),fc„)> where \ip ijk ) is an eigenvector of $§J and \^i,k 1 )^\^i,k L+1 )^ 

■■■ ® l^i,fc [(n _ 1)/L]L+1 ) belongs to the typical space for $^(p /o ), \A+ite) ® 

|^+i,fc L+1 ) ® • • • ® \^i+i,k [(n _ 1)/L]L+2 ) to that of $c°} +1 (pi ), etc. □ 
Similarly we have: 

Lemma 18 Let C be a periodic class with period L. Given i G C , and a se- 
quence i = (ji, ...,j m ) G {1,2,..., J} m , let P>f = Pfc\i)j be the projection 
onto the subspace of K,® n spanned by the eigenvectors of 

m 

■C',i+(r-l)l 

r=l 



with eigenvalues Xj ik = Y\T=i ^i+(r-i)/ Jr,kr suc ^ that 

1 



log X jik + S C ' 
m - 



< \ (89) 



where 

So = Jim ± Y,Pi )s ((*«' ® • • • ® ® 



P. 



(io)- 



.7 m. 



i'ec j 



(90) 
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For any 5 > there exists m' 2 such that for m > m' 2 , 



E Tr 



(mlo) 



Ah) pin) 



>l-5 2 



(91) 



Note that Sc can be equivalently expressed as 

S C ' = Hm Si', 



i'eC 



with Sv as in ( 1131) . 

The remainder of the proof is identical to that of Theorem 5.1 in [7j. For 
each c = C or c = (C, i') with i' 6 C" 6 C per , and j = (ji, . . . , j m ), we define, 
as before 



TV 



1/2 



N 



1/2 



(92) 



fc=i 



k=l 



Clearly K5 } <Pi n) - Ef=i4" } - 



Put 



vf : = e n<,®v§>+ e £^®v&V ( 93 ) 



C6C a 



This is a candidate for an additional measurement operator, E^L, for Bob 
with corresponding input state p^ +1 = pj 1 = p^f 1 <S> Pj 
the condition (i) [see below (1851) ]. is satisfied and we also have 



^ "f^P^®^---®^- dearly 



Lemma 19 

E 7cTr [ (fie ® vg])$T' M+n U m ' M *>pf o [n/lo] 



CGCa 



+ E [^® v (Soj)^'^( w(m '^®^ [B/ '° 1 

< 2 -n[C(*)-§e] 



(94) 



7i , = 1/L(C), for i' E C E C per . 

Proof. Writing <Jq = $^ (p^), by the proof of Lemma [TJ the following 
inequality holds for an aperiodic class C, for n large enough: 

Tr(4 n V^) <2- n &°~* e \ (95) 
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where xc — Xc^ is given by (J9l) , for the maximising ensemble, with n = Iq 
[c.f. P}]. 
Then 



ceCa 



< E ^ Tr t4 n) vS^ 

CGCap or 

< E 7c2" nte "^ ] 



(96) 



ceCa 



where we used the obvious fact that lie < 1 an d (i94jl . 

Similarly, for i' 6 C" G C per , denoting Q nji / = X^fcLi ^£i'> we nave ; using 
Lemma [13 

and hence 

— Tr U( n ) rp( n ) _n uppWpWpW /■p(«) _ n u/2 

~~ 11 °C',i'\- r C',i' Vn.t'J r C,i' r i',i r C ,i'V C ,i' ^£n,i> ) 

< 2- m ^-M Tr [(pW - Qn^P^P^ - Qn,>) 1 ' 2 

< 2~ m ^ Sc '~^Tr (P ( ,™-) 

< 2 _n[l o (S ' c "~' 5c ' ) ~^ 1 

< 2~ n[ *y^\ (97) 



where (see (jllj) ) 



»ec 



In the second last inequality of (|97j) . we use the fact that Tr (P^) < 

2 TraS c"+ e / 4 ; which is a standard consequence of Lemma HE1 We obtain the 
last line of (1971) by using the subadditivity of the von Neumann entropy, as 
in (EE}- 

Summing flDTl) over %' and C", and adding to the bound for C G C aper , 
yields the following bound: 

LHS of flUD < E 7c2"" [ ^ 0) "^ 



+ E E^' 2 " 



(9* 



25 



cm < a # + 



c&c 



e 

6' 



Now by §6 



and hence 



for all CsC, and therefore (194j) follows. □ 
By maximality of iV it now follows that the condition (ii) above cannot 
hold and as before we get, upon taking expectations, 

Corollary 2 

J2 ic E (Tr [ (n c ® V$)$® moM+n ® pf } 

CfE Caper 

< 1 - e. (99) 
We also need the following analogue of Lemma [HJ 
Lemma 20 Assume rf > 35. Then, for n large enough, 

7cTr [ (fie ® Pc,nP^Pc^T 0M+n) (^ (m ° M) ® pf 

er 



> 1 - 7/ 



(100) 



Proof. This is a simple consequence of LemmaEland its analogue for periodic 
classes, together with and (I53|) . □ 



Lemma 21 Assume rf < and write 



N 



N 



k=l 



Qn,c = E E Vc ( C e C a P cr) and Q n , v = ^ eQ (i' G C G C per ). (101) 
fc=l 

JTien /or n /arge enough, 

E 7cTr [ (fto ® Q„,c)$ S m ' M+ " (u> (m ' M) ® pf ^ 

Cfz Caper 

+ E E^' Tr [(ncv®Qn,iO^ M+ "(^ (m ' M) ®pf)] >(V) 2 - 

(102) 
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Proof. This is analogous to Lemma [9j □ 
It now follows, as before, that for n large enough, N(n) > (rj') 2 2 n ^ c ^'~3 e K 
We take the following states as codewords: 



p (moM+n) = w ( m „M) 8 ^W 



(103) 



For n sufficiently large we then have 

N = N n+moM = N(n) > (V) 2 2 n ^~^ > 2 ( m oM+n)[C(*)-e]_ ( 1Q4 ) 

To complete the proof, we need to show that the set {E^}^ =1 satisfies 
. However, this follows immediately from condition (ii) (after eq. flHol) ): 

Tr |~<|)( m oAf+n) ^(moM+n)^ ^(n) 



7cTr [$(? oM+ ") Li^M) p{n)\ E (n) 



ceCa 



+ E E^ Tr 



(m M+n) ( ^(rn a M) ^ ~(")^ 



£ 7cTr [(fl c ® fl£>) d>g 



ceCa 



+ E E ^ Tr [(nc.< ® *S° M+n) (V moM) ® 4 n) 

> 1-e. 



(105) 
□ 



6 Proof of the converse part of Theorem U 

In this section we prove that it is impossible for Alice to transmit classical 
messages reliably to Bob through the channel $ defined by ([3]) and (J4j) at 
a rate R > C(<3>). This is the (weak) converse part of Theorem [TJ in the 
sense that the probability of error does not tend to zero asymptotically as 
the length of the code increases, for any code with rate R > C($). To prove 
the weak converse, suppose that Alice encodes messages labelled by a G M. n 
by states pa in B{H® n ). Let the corresponding outputs for the class C of 
the channel be denoted by a^ c , i.e. 
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Further define 



Let Bob's POVM elements corresponding to the codewords p^ be denoted by 
£'i Tl ' ) , a = 1, . . . , |.M n |. We may assume that Alice's messages are produced 
uniformly at random from the set M. n . Then Bob's average probability of 
error is given by 

:= 1 - J- £ 1* . (108) 

We also define the average error corresponding to the class C of the channel 

as 

P"S == 1 " mT^ E Tr WG'SW] , (109) 

so that 

# = E^S- (no) 

cec 

Let X^™) be a random variable with a uniform distribution over the set 
M-ni characterizing the classical message sent by Alice to Bob. Let 
be the random variable corresponding to Bob's inference of Alice's message, 
when the codeword is transmitted through the class C. It is defined by the 
conditional probabilities 

P [Y^ = (3 | iW = a ] = Ti [^{p^E^}. (Ill) 

By Fano's inequality, 

h(p { ;l) + pP e \og(\M n \ - 1) > H(XM | Y^) = H(X^) - H(XV> : Y$>). 

(112) 

Here h(-) denotes the binary entropy and H(-) denotes the Shannon entropy 
By the Holevo bound, for C G C aper we have 



H(X {n) : Y c 



(n)^ 

c , 



\ 1 1 «eM„ / 1 n| ae.M„ 



\M n 

where Xc ( { ttVt > Pa f ) is S iven b Y CP 
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For C G C per , with period L, 

£ E iE»S(^))-i^ E «(|E W> 
= j^n E 4E WJiiijsi E lE« 
^ ud^ E E5(*g(P<">)iir^T E W» 



eE*S(<^>> 



L 

iec 

= ^>({^,P<">}). (I") 

In the above, we use the convexity of the relative entropy S(a\ \u) := Tr cx(log a— 
logo;), for density matrices a and to. 

Therefore, for any class C we have the upper bound 

H(X (n) . y W } < ^ (n5) 

Inserting this into Fano's inequality, (11121) . now yields 

MpS)+pSlogl^nl>logl^|-nxcf|n^'Pi n) i V ( 116 ) 



However, since 



and R = - log |A4 n | > C($), there must be at least one class C such that 

pS>l-^f^>0. (118) 
We conclude from ffTTUD and ffTTSl) that 

Pt' > (i - A 7C . (U9) 

□ 
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Remark 

Note that the strong converse property [H [23] does not hold for general 
Markovian channels. For example, for a convex combination of memoryless 
channel^: 

M 

$W(p(«)) = ^ 7i $f«(pH), (120) 

where $j : B(7i) — > B(), Bob's error probability does not tend to 1 asymp- 
totically in n for a rate R, such that C($) < R < (?($), where 

M 



cm :=V*?, 



i=l 



and x* denotes the Holevo capacity [THl EI] of the memoryless channel $j. 
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Appendix A 

Lemma 22 //$( n ) is a quantum channel with memory of the form (TJ}). Then 
the limit in ^T^\j exists. In particular, the limit in (d)) exists. 

Proof. Denote 

Xn= SUp A^ n) ({p5 n) ,* (n) (pJ n) )})- (121) 

{p ( ;\ P f}cec 

We shall prove that for any 5 > there exist n Q and m such that for all 
n' > n and n > rriQn', \ n > \n' ~ This proves the lemma because 
obviously, < \n < log dim/C, and it follows that 

lim inf \ n > x n < ~ 8 

n— >oo 

2 A classical version of such a channel was introduced by Jacobs [H] and studied further 
by Ahlswede pQ, who obtained an expression for its capacity. 
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and hence lim inf \ n > lim sup Xn' ~ $ where 5 > is arbitrary. 

To prove the statement, let n' be large, and suppose that {p^ 1 \ pf } is 
a maximising ensemble for (11211) . with n replaced by n'. Given n > n', put 
m = [n/n 1 ] and I = n — mn' . Define the states p^ 1 ' = &)™ =x Pjl ® PjmV 
where pj is the reduced state on Then p^ = ®5!L 1 p < - n '- ) <S> p®, with 

p(n') ._ J^jP^j 1 Pj We now write for any class C G C, 



h,...,i m +l£C i' 1 ,...,i' m+1 eC 



7«2 Tim+1 



XaJ? } (zi, i[) ® • • • ® O-J? } (z m , 4) ® Oc (Wl> C+l)> 



:i22) 



where 



i2,...,i„'_i6C 

x($ i ®$ i2 <g> •••®$ l 0(p ( "' ) ) (123) 

and similarly for ajl\i,i'). Let 7 = f\ i£l ji- Using positivity of the density 
operators and the fact that < 1 < 7^/7, we obtain the simple operator 
inequality 

^c\p {n) ) < ^c \p {n,) ) ® • • " ® ^V" ) ® 4V )- (124) 

Inserting this into the definition of S(^ n \p^)) and using the operator mono- 
tonicity of the logarithm and the fact that (7^) is the equilibrium distribution, 
i- e - J2ieiliQij = 7i> we obtain 

S($ { c\p {n) )) > m5($g i ' ) (p ( "' ) )) + S^cV )) + mlog 7 . (125) 
On the other hand, by subadditivity, 

m 

S{^\ P f)) < £ 5(4%?°)) + S(^(pf m+1 )) (126) 

r=l 

so that 

# ({pf^ipf)}) > ^Xn> + ^log 7 , (127) 
for all C e C. □ 
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Appendix B 

Proof of Lemma [4l Let l\ be so large that 

S M (0oo) < r S ^ {h) ) < ^(0oo) + |. (128) 
tl o 

Let f2 = {Afc} denote the spectrum of a^ ll \ and let 71"*. be the projection onto 
the eigenvector with eigenvalue \ k . For any r > and C C X r , put 

?c = ^i® •••8)7r fer , (129) 

(A fcl ,...,A fer )eC 

and define the probability measures v r on Q r and z/qo on f2 N by 

v r {C) = Tr ($ (Ho/l) (p? r/l) )gc) and Voo (C) = <M<7c)- (130) 

By Lemma [3J ^oo is ergodic and by McMillan's theorem [TB] there exists a 
typical set 

TP = {(X kl ,...,X kr )eQ r \ 

2 -r(h KS (v 00 )+e/8) < z/ r ({(A fel , . . . , AfeJ}) < 2~'" ( ' ljfs ( l/ ° o) - e/8) } , 

(131) 

satisfying 

z/ r (T e (r) ) > 1 -5 2 (132) 

for r large enough, where hxsi^oo) denotes the Kolmogorov-Sinai entropy. 
Now, 



hRS^oo) 



inf -H(u r ) < H{v x ) = S{a™) < h (£W(&o) + ^ , (133) 
r r V 8/ 

where H(v) denotes the Shannon entropy corresponding to the probability 
measure v. On the other hand 

h K s{v<x>) > h S M ((poo) (134) 

because, by positivity of the relative entropy, 

S(a {rll) ) 
= -Tr [a (ril) loga {Wl) ] 



< -Tr 



<? {rh) log f [Tr (a^] (7r fcl ® • • • ® vr fcr )) vr fcl <g> • • • <g> 7r fer j 
\fci,...,fc r / 



Tr [a (Wl) 7T fcl g> • • • ® 7r kr ] log Tr [a {Wl) vr fcl g> • • • <g> 7r fcr 



H{u r ). (135) 
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For arbitrary m, let r = [m/Zi] and define 



7T 



-(m) 



(m) 



vr fcl ®---®7r fer ®lGi3(/C® m ), k = {k u ...,k r ). (136) 



Let T^" v = {A; : (Afc 1; . . . , Afc r ) G Te^}, and define 



Clearly, 



Moreover, if k e T { ™\ it follows from ffTBTD . ffT33l) and ffT34l) that 

• s If' 



Tr[^)g TeM ] = ^(TW)>l-5 2 . 



1 r/i 

— logi/ r ({(A fcl ,...,A fcr )}) < ( SmWoo, , 

m m \ li 8 



and 



— logz/ r ({(A 



fel , • • • , A fer )}) > (s M $oo) + U + ~ 



Taking Zi > 3 and m large enough, we obtain 



— logi/ r ({(A fcl , . . . , A fcr )}) + S M (4>oo) 
m 



e 

< 6' 



Now let 



e 

fcer< m) 



(m) 



and assume that /i is so large that di/12 > — log7 m i n , where 7 n 
Note that 7 min > 0. Define 



(137) 



(138) 
(139) 

(140) 

(141) 
A ie j7«- 



(Jill, i 



E E 

7*0**2 ■ ■ ■ Qii-ii' 



MPiX (142) 



Jlj— Jl=l »2,— 

where j = {31,32, ■ ■ ■ ,ji)- Then we can write as in the proof of Lemma [21 

0*2*3 0*4*5 0*2r*2r+l 



E 



7*3 7*5 7 



*2r+l 



*lvi*2r+2 

Xaj^ii.ia) ® ■ ■ • ® ^Ii(i2r-l»*2r) ® 0" (m_ril) ( i 2r+l,^2r+2)- 

(143) 
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Using the positivity of the transition probabilities, we have 

Pmlo°ml Pml < 2~ m ^^l^ . 

By the fact that tt^ is an eigenprojection of a"*-' 1 -' we then have 

Pml CT m l Pml < 7 ^^(^-e/e) ^mlo) ^ 

But 7~[ n < 2~ me l 12 by the above assumption. □ 

Appendix C 

Proof of Lemma [6] In the following, we suppress the dependence on 1$. We 
follow Hiai & Petz [12j, as in Lemma HI Fix I > 12 large enough so that 

jS(E ll0 ) < Sm(Voo) - ^- (145) 

Let yf ] be the spectrum of af o) = $("»)(pJ o) <g> pj )} . . . ® p? o) ). Note that 
Tiu can be represented as a block-diagonal matrix in ® • . j /C^ z with 
spectrum consisting of eigenvalues z/j^ = PjCtj^ with j e {1, . . . , J}', = 
1, . . . , (dim (/Qjy, and atj^ being the eigenvalues of Let 

yi= U ^f- ( 146 ) 

ie{i,...,J}' 

We now define measures for s G N, on (3^) S D Y 

A*-(C0= £ pf^fW), ( 147 ) 
le{i,...,jy l 

where C C (3^z) s , and 

Qc= ^LM ® • • • ® Tj..*. ( 148 ) 

for j = , . . . , j ). (Here 7^ denotes the projection onto the fc-th eigenvec- 
tor of crj .) We also define the projective limit ^ on by 

fi 00 (C)=fi s (C)=^ 0D (q^), (149) 
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for a cylinder set C G {yi) s ■ It follows from Lemma [5] that /x^ is ergodic. 
Define typical sets 



T 



(») 



i C^fei) • • • ; \ a ,k a ) ^ 3^f | 

2- S (^ s (moc)+6/12) < /x s ({(Aj i>fel , . . . , A^J}) < 2- s ^ s ^)- £ / 12 )} , 

(150) 



where hxsi^oo) is the Kolmogorov- Sinai entropy of /ioo. By McMillan's the- 
orem |16j . 



>l-i*> 



for s large enough. Now, 



(151) 



h K s(Voo) = inf -H(fj. a ) 

s S 

< H(ji 1 )=3(E l ) 

< l( y S M (ip oo ) + -^ 

by (1145p . and on the other hand 

oo I 



(152) 



(153) 



by positivity of the relative entropy. 

For arbitrary m we argue as in Lemma [2], and let s = [m/l]. Writing, 
m = si + r, and j = (j u ...,j m ) = {j_ v . . • , J s , j Q ), we have 



^(nUo) = 7r^ )fci ® . . . (g) vr^ ifcs (8) 7T^\ 



(154) 



where nf' is the projection in @^ . =1 /C® r onto the j„-th summand. Let 



r H = fW_ xhen> 















© 


© -!? } 


= Tr 


Erf © <Z^) 




^je{l,...,J} m 






\je{i,...,J} si / 



> 1 - /• (155) 
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Moreover, if (A^, . . . , A^J G 

^ lo §^ ({(^.fcn---.^,*.)}) < ~ (s M (ipoo) - j-^J , (156) 



m 

and 



(158) 



^ lo gA ». ({(^M, • • • , Aj.,*.)}) > ^m(Voo) + (l + y) ^e) • (157) 
Finally define the typical set of indices j: 

T [m] = |j g {1 ? . . . ? J} m | 2 - m ( ff (te})+ £ / 12 ) < p (. m ) < 2 - m (^(te})- e / 12 )| . 

Then for m large enough, 

2 

if P denotes the probability with respect to the ensemble probabilities {pj}/=i- 
Defining 

{rp[m] -r ■ rp[m] 
if^T™, ( j 

we have for (A^, fel , . . . , A^, fe J G 7^°, 

^ lo § A 3 = -^ lo g(P^ ) ---pf) + ^log// s ({(A, :ijfel ,...,A^ i ,J}) 



< -- 



m 



(^oo)-J^)+^(fe}) + ^ 



< -5m + ^, (161) 



and 



^ lo § A 3 = -^ lo s(P^---pf) + -log/x s ({(A j:i , fel ,...,A^ ifc J}) 

1 



> -(^M + -e) (162) 
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for m large enough. Moreover, 













( 


\ 




e 




= Tr 


y 


e 



















^Tr faMipfty. 

jeT [m] 



i 

> 1-5 2 . 



(163) 
□ 
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